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(57) Abstract 



The invention provides a recombinant virus vector for use as an immunotherapeutic or vaccine. The recombinant vims vec- 
tor comprises at least one pair of nucleotide sequences heterologous to the virus and which have sufficient sequence homology 
that recombination between them might be expected. The pair of nucleotide sequences are arranged in the virus vector such that 
they are inverted with respect to each other. The virus vector is able to infect a mammalian host cell and express as polypeptide 
the heterologous nucleotide sequences in the host cell. For infection thought to be caused by HPV infection, the pair of nucleotide 
sequences encode part or all of human papillomavirus (HPV) wild-type proteins or mutant proteins immunologically cross-reac- 
tive therewith. For an immunotherapeutic or vaccine against cervical cancer, the recombinant virus vector encodes part or all of 
the HPV wild-type proteins HPV16E7 and HPV18E7 or mutant proteins immunologically cross-reactive therewith. 
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RECOMBINANT VIRUS VECTORS ENCODING HUMAN PAPILLOMAVIRUS PROTEINS 
This invention relates to recombinant virus vectors. 
In particular, it relates to recombinant virus vectors 
designed to overcome the problem of recombination 
5 between homologous nucleotide sequences. It also 
relates to recombinant virus vectors encoding human 
papillomavirus proteins; to iramuno therapeutics and 
vaccines for conditions associated with HPV infection; 
to the production of a virus (e.g. vaccinia virus) 
10 engineered to express antigens encoded by human 
papillomavirus types 16 and 18 and to 
immunotherapeutics and vaccines for cervical cancer. 

In recent years, strong evidence has been adduced 
for a link between cervical carcinoma and infection 
15 with certain types of human papillomavirus (HPV), 
particularly with types 16, 18, 31, 33 and 35 (Gissman 
et al.. Cancer Cells 5,275, 1987). This is based on 
hybridisation studies which have indicated that more 
than 85-90% of biopsies from cervical tumours can be 
20 shown to contain papillomavirus DNA. HPV16 DNA is 
most commonly found (in about 60% of tumours) with 
HPV18 the next most frequent (about 20%) and the other 
types accounting for a further 5-10%. In many 
instances, tumour cells from the biopsies do not 
25 however, contain the complete genome, but rather a 
deleted form. The extent and location of the deleted 
information within the virus genome is variable, but 
a general feature is the retention of the part of the 
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genome encoding the E7 protein ( Schwarz et al . , Nature 
314, 111, 1985). In addition, the adjacent 

E6-encoding region is usually present. The ubiquitous 
p resenC e of the E7-encoding region in tumour cells 
5 suggests that the protein product of this gene might 
play a role in the induction or maintenance of the 
transformed phenotype. Indeed in most cell lines 
established from tumour biopsies, expression of the E7 
gene can be detected (Smotkin & Wettstein, PNAS, 83, 
10 4680, 1986). Furthermore, it has been shown that the 
E7 gene product can bind to the retinoblastoma (Rb) 
gene product, a recognised w anti-oncogene" in normal 
human cells (Munger et al., EMBO J. 8,4099, 1989). 
This strengthens the belief that E7 is directly 
15 involved in cell transformation. 

The presence and expression of the E7 and E6 
genes in tumour cells derived from cervical carcinoma 
biopsies, suggests the possibility that these proteins 
could be potential targets for the immunological 
20 recognition of the tumour cells. It is well known 
that viral proteins produced inside mammalian cells 
can be processed through a host cell pathway to short 
peptides, which then form a complex with host Major 
Histocompatibility Complex (MHC) Class 1 molecules and 
25 are transported to the cell surface. These complexes 
may then present a target for recognition by the host 
immune system. Interaction of the complex with the 
receptor molecule on the surface of cytotoxic T cells 
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(the T cell receptor) can then lead to activation of 
the T cells to proliferate or to destroy the 
recognised cell. It is possible, therefore, that the 
presence in the body of a population of cytotoxic T 
5 lymphocytes (CTLs) which are capable of recognising 
cells expressing the HPV E6 and/or E7 proteins could 
afford protection against the development and 
proliferation of cervical tumours. Indeed it has been 
reported that normally oncogenic mouse cells 
10 engineered to express the HPV E7 protein are unable to 
form tumours in mice which have been previously 
immunised with non-tumori genie E7-expressing cells, 
and that this rejection is mediated by CD8+ 
lymphocytes (CTLs) (Chen et al., PNAS 88, 110, 1991). 
15 Further, the generation of an active population of 
such cells subsequent to tumour initiation could 
result in regression of the tumour. 

There are numerous reports on the construction of 
recombinant viruses eg vaccinia viruses containing, 
20 and expressing foreign genes (Mackett 6 Smith, J. gen. 
Virol. 67,2067, 1986), and several reports of the use 
of these recombinant viruses to generate effective 
immune responses against the expressed foreign 
antigens. A particular advantage of this route for 
25 delivery of antigens for vaccination is that it may 
lead to the development of cellular as well as humoral 
immunity. This is because the foreign proteins will be 
produced inside cells of the infected individual in a 
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manner similar to that which occurs during natural 
infection. This means that they should be processed 
through the correct pathway to allow generation of a 
CTL response- In several cases, it has been 
5 demonstrated directly that immunisation with the 
recombinant virus is capable of producing a cellular 
immune response in the form of foreign 
antigen- specific CTLs (Moss & Flexner, Ann. Rev. 
Immunol., 5,305, 1987) • Furthermore, vaccination of 
10 animals with recombinant vaccinia viruses expressing 
certain tumour- specific antigens, such as the human 
melanoma-associated antigen P97 (Estin et alo, PNAS, 
85, 1052, 1988), the bovine papillomavirus E7 protein 
(Meneguzzi et alo. Vaccine, 8, 199, 1990) and the 
15 human breast cancer-associated antigen ETA (Hareuveni 
et alo, PNAS, 87, 9498, 1990) has been demonstrated to 
result in the induction of immunity against tumour 
initiation and progression. 

The present applicants have recognised the 
20 desirability of producing a recombinant virus vector 
which is useful as an immunotherapeutic or vaccine for 
conditions caused by HPV infection, for example for 
cervical cancer. With respect to cervical cancer, the 
art at the time of the applicants making the present 
25 invention recognised the E7 gene as having the 
potential to immortalise cells. Therefore, it would 
be felt inappropriate to incorporate the E7 gene in an 
immunotherapeutic . The applicants however , have 
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recognized the surprising usefulness of including the 
E7 gene in an immunotherapeutic . They have also 
recognized that the beneficial effects to be gained by 
treatment with an immunotherapeutic comprising the E7 
5 gene are likely to outweigh by far any risk associated 
» with the oncogenic activity of the E7 gene. Thus, an 
aspect of the applicants invention involves the use of 
a recombinant virus vector which expresses an E7 gene, 
as an immunotherapeutic or vaccine. Furthermore, the 

10 applicants provide embodiments of their invention in 
which these risks are reduced still further by 
j specific alteration of the gene sequences in order to 
reduce the oncogenic potential of the E7 gene without 
compromising its ability to stimulate an appropriate 

15 immune response. 

The present applicants have also recognized that 
where a number of HPV proteins which may be encoded by 
different HPV strains are implicated as being 
associated with a particular HPV-associated condition 

20 (for example, cervical carcinoma, HPV16 nd HPV18; 
genital warts, condyloma acuminata, respiratory 
papillomatosis, HPV6 and HPV11; squamous cell 
carcinoma in immuno suppressed individuals, HPV 5 and 
HPV8 ) , rather than produce a plurality of recombinant 

25 viruses engineered separately to express each of the 
implicated proteins, it would be advantageous to 
produce a single virus recombinant which is able to 
express part or all of the sequences of more than one 



BNSOOCID: <WO 92iee36A1_L> 



WO 92/16636 



PCT/GB92/00424 



6 

of the proteins o Thus, with respect to cervical 
cancer, rather than produce four recombinant viruses 
engineered separately to express each of the potential 
targets for immunological recognition of cervical 
tumour cells i.e. the HPV16 E6, HPV16 E7, HP VI 8 E6 and 
HP VI 8 E7 proteins, it would be especially advantageous 
to produce a single virus recombinant which is able to 
express the part or all of the sequences of more than 
one of the proteins, preferably at least two of the 



10 proteins and most preferably all four proteins. That 
/the present applicants are able to achieve this is 
I particularly surprising . This is because, the coding 
sequences for many HPV proteins are highly homologous 
to other equivalent HPV proteins (for example from 

15 other virus strains). Thus, the HPV16 E6 and HPV18 E6 
proteins show overall homology of 62% and comprise 
regions of very high homology. The same is true for 
HP VI 6 E7 and HP VI 8 E7 which show overall homology of 
57%, with particular regions of very high homology. 

20 This means that one would expect recombination to 
create problems such as loss of gene sequences. The 
applicants have, however, devised a novel strategy 
designed to minimise the likelihood of such 
recombination events and to circumvent the deleterious 
25 effect of those events should they indeed arise. 
Thus, surprisingly, the invention provides recombinant 
virus vectors which comprise at least one pair of 
nucleotide sequences which have sufficient sequence 
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homology that recombination between them might be 
expected. The at least one pair of nucleotide 
sequences may encode part or all of human 
papillomavirus (HPV) wild-type proteins or mutant 
5 proteins immunologically cross -reactive therewith. In 
particular, the invention provides a recombinant 
vector which can maintain stably, and express, part or 
all of four of the desired gene sequences from HPV 16 
and HPV18 . 

10 Thus, the present invention provides a 

recombinant virus vector for use as an 
immunotherapeutic or vaccine which comprises at least 
one pair of nucleotide sequences heterologous to said 
virus and which have sufficient sequence homology that 

15 recombination between them might be expected wherein 
said pair of nucleotide sequences are arranged in said 
virus vector such that they are inverted with respect 
to each other to reduce the likelihood of 
recombination events leading to loss of part or all of 

20 said sequence and said virus vector is able to infect 
a mammalian host cell and express as polypeptide the 
heterologous nucleotide sequences in said host cell. 
The at least one pair of nucleotide sequences may 
encode part or all of human papillomavirus (HPV) wild* 

25 type proteins or mutant proteins immunologically 
cross-reactive therewith. The pair of nucleotide 
sequences may encode parr or all of the protein E7 
from both HPV 16 and HPV18 or functional equivalents 
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thereof o The pair of nucleotide sequences may encode 
part or all of the proteins E6 from both HPV16 and 
HP VI 8 or functional equivalents thereof. 

The recombinant virus vector may comprise a 
5 further pair of nucleotide sequences heterologous to 
said virus and which (i) have sufficient sequence 
homology that recombination between them might be 
expected wherein said further pair of nucleotide 
sequences are arranged in said virus vector such that 
10 they are inverted with respect to each other and said 
virus vector is able to infect a mammalian host cell 
and express as polypeptide the further pair of 
heterologous nucleotide sequences in said host cell. 
The further pair of nucleotide sequences may 
15 encode part or all of HPV wild-type proteins or mutant 
proteins immunologically cross -reactive therewith „ 

For example, the present invention also provides 
a recombinant virus vector which in addition to the E7 
coding .sequences, also comprises and is adapted to 
20 express genetic sequences encoding part or all of the 
protein E6 from both HPV16 and HPV18 or functional 
equivalents thereof. The genetic sequences may 
comprise sequences encoding HPV16 E6/E7 and HPV18 
E6/E7 as shown in figures 1(a) and 1(b) 
2 5 respectively . 

The genetic sequences may encode an antigenic 
moiety of the said proteins. 

Either or both of the nucleotide sequences in a 
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pair of nucleotide sequences may be altered to make 
them less homologous than an equivalent pair of 
nucleotide sequences encoding wild-type HPV proteins. 
The alteration in nucleotide sequence may be in an 
5 area of high sequence homology. Preferably, the 
alteration in nucleotide sequence will not result in 
an alteration of the encoded amino acid sequence. 

Two or more nucleotide sequences each encoding 
separate proteins may be fused together to form a 

10 single open reading frame. Thus the genetic sequences 
encoding part or all of the proteins E6 and E7 from 
HPV16 , may be fused together to form a single open 
reading frame. The genetic sequences encoding part or 
all of the proteins E6 and E7 from HPV18, may be fused 

15 together to form a single open reading frame. The 
genetic sequences encoding part or all of the proteins 
E6 and E7 from both HPV16 and HPV18, may be fused 
together to form a single open reading frame. Thus, 
the recombinant virus vector may have the pairs of 

20 nucleotide sequences arranged according to any one of 
the options shown in Figure 26. Where the recombinant 
virus vector comprises an open reading frame having a 
fused genetic sequence encoding part or all of the 
proteins E6 and E7 from HPV16 , and a separate open 

25 reading frame having fused genetic sequences encoding 
part or all of the proteins E6 and E7 from HPV18, the 
two open reading frames may be inverted with respect 
to one another. For example, the two open reading 
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frames may be arranged in the recombinant: virus vector 
adjacent to each other. The inversion may be such 
that the E6 coding sequences of HPV16 and HP VI 8 are 
both located between the E7 coding sequences of HPV16 
5 and HPV18 • Alternatively, the inversion could be such 
that the E7 coding sequences of HPV16 and HPV18 are 
both located between the E6 coding sequences of HPV16 
and HPV18 . In particular the two open reading frames, 
each with its respective promoter, may be arranged 
10 next to each other in the recombinant vector . In this 
case the promoters may be located between the genes, 
which are transcribed outwardly, or the promoters may 
be located outside the genes, which are transcribed 
inwardly- 

15 Similarly, the genetic sequences encoding part or 

all of the E7 protein from HP VI 6 and the E7 protein 
from HPV18 may be fused together to form a single open 
reading frame. The genetic sequences encoding part or 
all of the E6 protein from HPV16 and the E6 protein 

20 from HP VI 8 may be fused together to form a single open 
reading frame. This leads to another range of 
arrangements similar to those shown in Figure 26. The 
fusions may be via a single codon encoding a 
relatively small neutral amino acid e.g. glycine. 

25 Thus the present invention also provides a 

recombinant virus vector which comprises a first open 
reading frame having a fused genetic sequence encoding 
part or all of the wild-type proteins E6 and E7 from 
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HPV16; and a separate second open reading frame having 
a fused genetic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HPV18; wherein the 
first and second open reading frames may be inverted 
5 with respect to one another whereby either: i) the E6 
coding sequences of HPV16 and HPV18 are both located 
between the E7 coding sequences of HPV16 and HPV18; or 
ii) the E7 coding sequences of HPV16 and HPV18 are 
both located between the E6 coding sequences of HPV16 
10 and HPV18; and wherein any of said wild- type proteins 
may be replaced by a mutant protein immunologically 
cross -reactive therewith. 

Each of the first and second open reading frames 
may have a corresponding promoter and the two open 
15 reading frames each with its promoter, are arranged 
next to each other in the virus. 

The present invention also provides a recombinant 
virus vector wherein either: i) the promoters are 
located between the first and second reading frames 
20 whereby the open reading frames are transcribed 
outwardly; or ii ) the promoters are located outside 
the first and second open reading frames whereby the 
open reading frames are transcribed inwardly. 

The present invention also provides a recombinant 
25 virus vector which comprises a first open reading 
frame having a fused genetic sequence encoding part or 
all of the wild-type proteins E6 and E7 from HPV16; 
and a separate second open reading frame having a 
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fused genetic sequence encoding part: or all of the 
wild-type proteins E6 and E7 from HPV18; wherein the 
E6 coding sequences of HPV16 and HP VI 8 are both 
located between the E7 coding sequences of HPV16 and 
5 HP VI 8; and each open reading frame has a corresponding 
promoter , the promoters being located between the 
first and second open reading frames whereby the open 
reading frames are transcribed outwardly; and wherein 
any of said wild-type proteins may be replaced by a 
10 mutant protein immunologically cross-reactive 
therewith. 

The wild- type proteins HPV16E7 and HPV18E7 may be 
replaced with mutant proteins which are substantially 
homologous to said wild- type proteins and in which the 
15 residues cys 24 and glu 26 of wild-type protein 
HPV16E7 and the residues cys 27 and glu 29 of wild- 
type protein HPV18E7 are replaced with glycine 
residues . 

The recombinant virus vector may be derivable 
20 from vaccinia virus. 

The applicants have also recognized that for 
effective function as an immunotherapeutic, it is 
desirable for the recombinant virus to retain its 
ability to replicate and thereby generate an active 
25 infection in order that a cellular immune response may 
be mounted against the virus-encoded proteins. Thus, 
the applicants propose that the foreign gene sequences 
should be inserted into the vector virus at sites, the 
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disruption of which by the insertion of the 
heterologous gene sequences will not substantially 
interfere with, and therefore have a substantially 
adverse affect on any viral functions which relate to 
5 the replicative ability of the virus in the infected 
host animal* The applicants have named these sites 
1 neutral sites 1 ( although the term • neutral ' should 
not be interpreted strictly as it is acknowledged that 
the disruption of these sites may have a small, but 
10 relatively speaking inconsequential adverse affect on 
replicative ability). 

DNA sequences which affect virus replication can 
fall into several categories: 
i ) protein coding sequences ; 

15 ii) elements involved in control of gene expression; 
and iii) elements involved in virus DNA replication 
A non-essential and neutral insertion site must 
therefore avoid, such regions, and, such sites have 
been identified on 
20 the basis of nucleotide sequencing studies. Thus the 
genetic sequences may be inserted into neutral sites 
within the virus genome. One or more genetic 
sequences may be inserted into the same neutral site. 

Neutral sites can be easily tested for according 
25 to techniques well known in the art. For example, a 
site may be selected, interrupted or deleted using 
standard methodologies and the resultant recombinant 
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virus placed in conditions which normally support 
growth of the wild type virus vector, to assess the 
effect of the manipulations. The pathogenicity of the 
virus may be further compared with that of the 
5 unmodified virus vector strain in animal models, in 
order to assess its level of attenuation. 

In the present invention, the virus vector may be 
vaccinia virus. The vaccinia virus may be attenuated 
or disabled so that it is unable to fully replicate 
10 and esablish an extensive infection of host cells. 

Vaccinia virus has been used extensively in the 
past for vaccination against smallpox, and its use 
worldwide has led to the complete eradication of the 
disease (Bhebehami, Microbiol. Rev., 47, 455, 1983). 
15 During the World Health Organisation (WHO) campaign to 
eradicate smallpox, several different strains of 
vaccinia virus were used as vaccines. In 1984 a 
meeting was sponsored by the WHO to discuss the use of 
vaccinia virus as live virus vectors (Bulletin of the 
20 WHO 63(3):471-477). The data in this report indicates 
that the number of complications associated with 
vaccination was lowest for the Wyeth strain of 
vaccinia virus, and so this strain has been chosen as 
a basis for the construction of the recombinant virus 
25 according to an embodiment of the present invention. 

It is well known that insertion of foreign DNA 
into the genome of vaccinia virus at certain favoured 
sites, such as the thymidine kinase gene locus, can 



WO 92/16636 



PCI7GB92/00424 



15 

reduce dramatically the ability of the virus to 
replicate in vivo. As discussed above, the aim of the 
therapeutic approach described here is to generate an 
active in vivo infection, so that a cellular immune 
5 response may be mounted against the virus encoded 
proteins. The present invention provides a method for 
inserting foreign genes at neutral sites within the 
genome of a virus, the disruption of which sites by 
the insertion will not interfere with and therefore 
10 have a substantially adverse affect on virus 
replication. 

Where the virus is vaccinia virus, the neutral 
site may be identified herein within the Wyeth strain 
of vaccinia virus on the basis of the related WR 

15 strain nucleotide sequence. Alternatively, where 
other vaccinia virus strains are used, sites 
equivalent to those sites identified above may be 
used. The neutral sites may be any as identified 
hereinafter as A,B,C and D, or a functional 

20 equivalent. 

For successful expression of foreign proteins by 
the recombinant virus vector the foreign genes must be 
placed under the control of a promoter sequence which 
is operable by the virus. Thus the recombinant virus 

25 vector may comprise a single promoter which controls 
the expression of all the heterologous genetic 
sequences within a single open reading frame. 
Alternatively, where the recombinant virus vector 
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encodes more than one open reading frame containing 
heterologous genetic sequences, the virus may comprise 
a first promoter which controls the expression of the 
genetic sequences from a first open reading frame , and 
5 one or more further promoters which control the 
expression of the genetic sequences from one or more 
further open reading frames «, The promoter sequence 
may be virus- specific and several have been 
characterised so far (Davison & Moss, J. Mol. Biol., 
10 210, 749, 1989; Davison & Moss, J. Mol. Biol., 210, 
771, 1989). The single promoter and the first and one 
or more further promoters may be the p 7.5 promoter. 
There have been reports that the induction of foreign 
antigen-specific CTLs requires expression of the 
15 antigen early in the virus replication cycle (Coupar 
etal., Eur. J. Immunol., 16, 1479, 1986). Therefore, 
a recombinant virus as provided by the present 
invention may involve the use of the p7 . 5 promoter 
(Venkatesan et al., Cell, 125, 805, 1981) and/or the 
20 H6 promoter (Rosel et al., J. Virol, 60, 436, 1988), 
both of which are active both early and late in 
infection. 

As mentioned earlier, it has been reported that 
the E7 gene on its own has the potential to 
25 immortalise cells (Phelps et al., Cell 53, 539, 1988). 
In an embodiment of the present invention, the 
strategy for expression of the protein involves 
production of E7 as a fusion protein with E6, which is 
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unlikely to retain biological function. Embodiments of 
the invention provide for reducing this risk still 
further, by making changes within the E7 gene which 
are known to destroy its oncogenic capacity ( Chesters 
5 et al., J. Gen Virol. 71, 449. 1990). Thus in the 
recombinant virus vectors of the present invention, 
the genetic sequences encoding part or all of the E7 
proteins may be altered from the equivalent wild type 
sequences, in order to render the sequences, used in 
10 the recombinant virus vectors less oncogenic than 
their equivalent wild type sequences. 

The present invention also provides 
pharmaceuticals comprising recombinant virus vectors 
as herein defined. The pharmaceutical may be for use 
15 against a condition caused by HPV infection which 
comprises an immunotherapeutically effective amount of 
a recombinant virus vector. The pharmaceutical may be 
for use against cervical cancer. 

The pharmaceutical may be a vaccine to immunise 
20 against a condition caused by HPV infection which 
comprises an amount of recombinant virus vector as 
herein provided which when administered to a recipient 
can specifically activate cells of the immune system 
to HPV proteins. The vaccine may be for immunisation 
25 against cervical cancer. 

The pharmaceuticals may comprise one or more 
excipients. The present invention also provides 
methods of using the recombinant . virus vectors as 
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herein defined "to make medicaments for use as 
imraunotherapeutics or vaccines against conditions 
thought to be caused by HPV infection. For example 
for the prophylaxis and treatment of cervical cancer. 
5 The present invention also provides methods of 

treating mammalian patients with recombinant virus 
vectors and pharmaceuticals as herein provided. 

The present invention also provides a method of 
determining a neutral site in a virus vector, the 
10 disruption of which by the insertion of heterologous 
gene sequences will not interfere with, and therefore, 
have a substantially adverse affect on viral function 
which relates to the replicative ability of the virus. 
The method for this determination comprises s (a) 
15 analysing a viral genome to identify open reading 
frames which are likely to encode functional genes, 
by looking for expected codon usage between spaced 
apart start and stop codons; and (b) selecting sites 
which are not in such open reading frames, likely to 
20 encode functional genes, as identified in (a) This may 
include selecting sites between open reading frames 
for sequences of functional genes and selecting sites 
which are in open reading frames which have some 
functional gene characteristics, such as an expected 
25 codon usage, but have lost other essential 
characteristics such as a start codon. The method may 
also comprise interrupting or deleting the selected 
sites from the viral genome and placing the resultant 
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virus in conditions which normally support: growth of 
the wild type virus. 

The present invention also provides neutral sites 
identified by use of the above methods. 
5 The present invention provides an embodiment 

which shows a way of inducing a cellular immune 
response against the papillomavirus proteins usually 
expressed in cervical tumour cells by the creation of 
a recombinant vaccinia virus, which has been 

10 engineered to produce the HPV E6 and E7 proteins, or 
proteins containing HPV E6 and E7 sequences, during 
its replication cycle. This therapeutic vaccinia 
virus contains the E6 and E7 genes from both HPV16 and 
HPV18, the viruses most commonly associated with 

15 cervical carcinoma. Vaccination with this single 
virus may thus stimulate immunity to the E6 and E7 
proteins of the HPV types associated with more than 
80% of cervical tumours. Expression of all four gene 
sequences (e.g. HPV16 E6 and E7; HPV18 E6 and E7 ) in 

20 a single virus however presents a problem, because of 
the likelihood of loss of genetic sequences through 
recombination. The present invention provides a 
method for circumventing this difficulty, firstly 
through specific sequence alteration, in order to 

25 reduce sequence homology and secondly through their 
insertion into the vaccinia virus genome in such a way 
that if such recombination were to occur, it would not 
lead to loss of sequences (i.e. in inverted 
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orientation with respect to each other ) . Expression 
of the desired four gene sequences in the vaccinia 
virus genome could also be difficult (though not 
impossible) to achieve as independent expression 
5 units, and so the invention provides that instead, the 
E6 and E7 open reading frames may be fused together. 
A problem with standard methods for insertion of 
foreign information into the vaccinia virus genome is 
that the use of selectable markers to increase the 
10 efficiency of recombination results in the ultimate 
presence in the recombinant virus also of the 
selectable marker gene itself. Methods for insertion 
have been developed however, which allow subsequent 
elimination of these extraneous sequences (Falkner & 
15 Moss J. Virol- , 64, 3108, 1990) and these are used in 
an embodiment of the present invention to ensure that 
the final recombinant vaccinia virus has only those 
additional sequences which are necessary for its 
required function. 
20 In order that the present invention is more fully 

understood an embodiment will now be described in more 
detail with reference to the figures in which: 

Figure 1(a) shows the nucleotide sequence and 
three-frame translation of HP VI 6 E6/E7 polymerase 
25 chain reaction product (underlined regions indicate 
the E6 and E7 coding sequences); Figure 1(b) shows the 
nucleotide sequence and three- frame translation of 
HP VI 8 E6/E7 polymerase chain reaction product 
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(underlined regions indicate the E6 and E7 coding 
sequences ) ; 

Figure 2 shows the cloning and modification of the 
HPV16 and HPV18 E6 and E7 genes; 
5 Figure 3 shows an open reading frame plot of vaccinia 
virus from positions 17201-18450 of the region covered 
by the four fragments SalF,G,H and I; short vertical 
lines denote termination codons, lines topped with 
boxes denote initiation codons, rectangles show 

10 relevant open reading frames, and arrows show 
direction of upper and lower DNA strands; 
Figure 4 shows an open reading frame plot of vaccinia 
virus from positions 21001-22000 of the region covered 
by the four fragments SalF,G,H and I; short vertical 

15 lines denote termination codons, lines topped with 
boxes denote initiation codons, rectangles show 
relevant open reading frames and arrows show direction 
of upper and lower DNA strands; 

Figure 5 shows an open reading frame plot of vaccinia 
20 virus from positions 23501-25000 of the region covered 
by the four fragments SalF,G,H and I; short vertical 
lines denote termination codons, lines topped with 
boxes denote initiations codons , rectangles show 
relevant open reading frames and arrows show the 
25 direction of upper and lower strands of DNA; 

Figure 6 shows a codon usage plot of vaccinia virus 
from positions 17201-18450 of the region covered by 
the four fragments SalF,G,H and I; arrows show 
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Figure 7 shows a codon usage plot of vaccinia virus 
from positions 21001-22000 of the region covered by 
the four fragments SalF, G, H and I; arrows show 
5 direction of each DNA strand; 

Figure 8 shows a codon usage plot of vaccinia virus 
from positions 23501-25000 of the region covered by 
the four fragments SalF,G,H and I; arrows show 
direction of each DNA strand; 
10 Figure 9 shows the DNA sequence around site A showing 
translations in single letter amino acid code of genes 
SalF 17R and SalF 19R; 

Figure 10 shows the DNA sequence around site 8 showing 
translations in single letter amino acid code of genes 
15 SalF 20R and SalF20.5R; 

Figure 11 shows a comparison of the SalG2R open 
reading frame to the yeast guanylate kinase gene 
sequence; 

Figure 12 shows the DNA sequence around site D showing 
20 translations in single letter amino acid code of genes 
HindB3R and Hind B4R; 

Figure 13 shows the cloning of vaccinia virus (Wyeth 
strain ) neutral sites ; 

Figure 14 shows the cloning of vaccinia virus promoter 
2 5 sequences ; 

Figure 15 shows the construction of vaccinia 
promoter-driven E6-7 cassette; 

Figure 16 shows the cloning of the E6-7 cassette into 
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vaccinia virus (Wyeth strain) neutral sites; 
Figure 17 is a diagram showing the recombination 
required to generate the final therapeutic vaccinia 
virus - HPV recombinant virus; 
5 Figure 18 shows the synthetic oligonucleotides used in 
the construction of the therapeutic vaccinia virus HPV 
recombinant ; 

Figure 19 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 17201-18450 of the 
10 region covered by the four fragments SalF,G,H and I; 
Figure 20 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 21001-22000 of the 
region covered by the four fragments SalF,G,H and I; 
and 

15 Figure 21 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 23501-25000 of the 
region covered by the four fragments SalF,G,H and I. 
Figure 26 shows, a variety of options for arrangment of 
HPV16E6 and E7 and HPV18E6 and E7 coding sequences in 

20 a recombinant virus vector. 

All cloning procedures are carried out according 
to the protocols described in "Molecular Cloning", A 
Laboratory Manual , eds. Sambrook, Fritsch and 
Maniatis, Cold Spring Harbor Laboratory Press, 1989. 

25 All plasmids on which site directed mutagenesis is 
performed are of the "phagemid" type, which may be 
converted to single- stranded DNA by superinfection 
with the bacteriophage fl. Preparation and site 



BNSOOCtD: <WO 921663CA1JU> 



WO 92/16636 



PCT/GB92/00424 



24 

directed mutagenesis of single-stranded DNA, is 
carried out as described by Brierley et al, f Cell, 57, 
537, 1989. The sequence of all the synthetic 
oligonucleotides used are provided in figure 18. 

5 Preparation of the E6 and E7 genes from HPV16 and 
HP VI 8 for insertion into vaccinia virus 
Cloning of the HP VI 6 and HPV18 E6 and E7 
A fragment of DNA containing the HPV16 E6/7 coding 
region is prepared by polymerase chain reaction (PCR) 
10 amplification from the plasmid pBR322/HPV16 (Durst et 
al., PNAS, 80, 3812, 1983) using the oligonucleotides 
SOS and S05. A fragment containing the same region 
from HPV18 is prepared by the same procedure from 
plasmid pBR322/HPV18 (Boshart et al., EMBO J. 3,1151) 
15 using the oligonucleotides SOI and S02. Plasmids 
PBR322/HPV16 and pBR322/HPV18 are both available from 
Behringwerke AG; P.O. Box 1140 D-3550, Marburg, 
Germany (alternatively the necessary sequences can be 
created synthetically from the sequence information 
20 provided by the present application). 

In each case, this produces a DNA fragment of 
about 800 base pairs (bp) with a site for the 
restriction enzyme Nco 1 (CCATGG) located exactly at 
the beginning of the E6 gene, and a Smal site 
25 immediately downstream of the termination codon for 
the E7 gene ( figure 1(a) and ( b ) ) . The products are 
then digested with Ncol and Smal, and cloned into 
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Ncol-Smal digested plasmid pUC118NS (a modified 
version of the "phagemid" pUC118 (Viera & Messing, 
Methods Enzymol., 153,3, 1987) in which Ncol and Smal 
sites have been created by site-directed mutagenesis 
5 within the poly-linker region) to generate the plasraid 
plMS7, containing the HPV16 sequences, and pIMS8 
containing the HPV18 sequences (figure 2). The use of 
pUC118 is not crucial to the present strategy as any 
plasmid which can be manipulated by site directed 
10 mutagenesis can be successfully used. 

Fusion of the E6 and E7 ORFs 

For insertion into vaccinia virus, the E6 and E7 genes 
from each HPV type, are first fused together to form 
a single continuous ORF. This is achieved by 
15 site-directed mutagenesis as follows: 

(i) The termination codon TAA of HPV16 E6 in pIMS7 is 
altered using the oligonucleotide S20 to the sequence 
GGAA. This is in order to convert the normally 
separate ORFs for HPV16 E6 and E7 into a single ORF 

20 (pIMS7.1 - figure 2). 

(ii) The termination codon TAA of HPV18 E6 in pIMS8 is 
altered using the oligonucleotide S21 to the sequence 
GGAA. This is in order to convert the normally 
separate ORFs for HPV18 E6 and E7 into a single ORF 

25 (pIMS8.1 - figure 2). 

Abolition of the immortalising potential of E7 
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In order to destroy the immortalising properties of 
each of the E7 proteins, two key codons within the 
HPV16 E7 coding sequence, (cys24 and glu26 - figure 
1(a)) and the equivalent codons from HPV18 E7 (cys27 
5 and glu29 - figure 1(b)), are altered to glycine 
residues by site directed mutagenesis as follows • 

(i) The sequence of the E7 gene is altered in pIMS7 
to encode glycine at codons 24 and 26 (normally 
encoding cysteine and glutamate respectively, using 

10 oligonucleotide S22 (pIMS7„2 - figure 2). 

( ii ) The sequence of the E7 gene is altered in pIMS8 
to encode glycine at codons 27 and 29 (normally 
encoding cysteine and glutamate respectively, using 
oligonucleotide S23 (pIMS8olB - figure 2). 

15 Reduction in intertypic recombination potential of 
HP VI 6 and HPV18 E6 and E7 sequence and elimination of 
potential vaccinia virus transcription termination 
signal 

A potential difficulty with the presence of both HP VI 6 
20 and HP VI 8 E6 and E7 specific DNA within the genome of 
a single virus, is that recombination between the two 
sets of related sequences could lead to loss or 
rearrangement of information such that expression of 
the required proteins is disrupted. The invention 
25 provides ways of minimising this risk. Firstly, by 
inserting the two sets of genetic information in the 
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vaccinia genome in opposite orientation to each other 
( so that recombination will result not in the loss of 
sequence information, but rather in its inversion). 
Secondly, by creating specific changes in the E6/7 
5 sequence of one of the HPV virus strains at sites 
where the homology is greatest. These changes however 
are made in such a way that the amino acid coding 
potential of the genes remains unaltered. 

The HPV18 E6 sequences is therefore altered by 
10 site-directed mutagenesis as follows: 

The sequence TTTTTATTCTAGAATTAGAG (which begins 
210 nucleotides from the start of E6 - underlined 
in figure Kb)) is mutated, using oligonucleotide 
S24 to the sequence TTTCTACAGTAGAATCAGAG (pIMS8.2 
15 - figure 2) (changed nucleotides are in bold 

type ) . 

A second aim of this change is to eliminate from 
the HPV18 E6 sequence, the sequence TTTTTAT, which is 
a potential termination signal for the early vaccinia 
20 virus transcription enzyme (Rohrmann et al. , Cell. , , 46, 
1029, 1986). 

Source and propagation of vaccinia virus 
The Wyeth strain of vaccinia virus is used for 
construction of the therapeutic virus. It is propagated 
25 in Vero cells for the purposes of genetic manipulation, 
and in the human diploid fibroblast cell line MRC5 for 
the production of the final therapeutic virus stock. 
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Both cell lines are obtained from the National Institute 
of Biological Standards and Control , South Minis, U.K.. 
The Wyeth strain of vaccinia virus, vero cells and the 
cell line MRC5 are also available from the American Type 
5 Culture Collection, 12301 Parklawn Drive, Rockville, 
Maryland 20852, USA. 

Identification and cloning of neutral sites from the 
vaccinia virus 
Description of neutral sites 
10 For the purpose of insertion of papillomavirus genes 
within the vaccinia virus genome, sites have been chosen 
to have two characteristics. 

Firstly, they should be non-essential regions, i.e. 
insertion of foreign genes at these points, will not 
15 disrupt any functions of the vaccinia virus to the 
extent that the virus can no longer grow in tissue 
culture . 

Secondly, they should be neutral sites, i.e. 
insertion of foreign genes all these points, will not 

20 increase or decrease the level of attenuation of the 
vaccinia virus. 

The difference between these two factors can be 
seen by looking at the thymidine kinase (TK) gene of 
vaccinia virus. It is a non-essential region, and hence 

25 viruses with genes inserted in the TK gene can grow well 
in tissue culture (Mackett at al., J. Virol., 49, 857, 
1984). However, such viruses have been found to be 
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greatly attenuated in vivo (Buller et al., Nature 317, 
813, 1985). For the purpose of prophylactic 

vaccination, such increased attenuation might be 
desirable. However, for an immunotherapeutic strategy 
5 where the danger from the disease to be treated clearly 
outweighs the risk of vaccine associated complications, 
use of an attenuated virus is considered undesirable 
since it could compromise the immunological response to 
the papillomavirus antigens. Hence, the applicants have 
10 identified sites which they judge will not attenuate the 
virus any further, and have termed them 'neutral sites'. 
Such sites have been identified within the virus genome 
by careful analysis of the DNA sequence of the WR 
strain. The WR strain was originally derived from the 
15 Wyeth strain by passage in mouse brain. Therefore the 
two strains are closely related. The nucleotide 
sequence of three regions of the WR genome which contain 
the selected neutral sites are shown in figures 19,20 
and 21. Four neutral sites (A-D) have been chosen on 
20 the above discussed criteria as follows: 
Site A: gap between SalF17R and SalF19R 
Site B: gap between SalF19R and SalF20.5R 
Site C: in SalG2R, a potential non- functional gene 
Site D: in HindB3.5R, a potential non- functional gene 
25 These sites (A-D) can be identified by the following 
stretches of DNA sequence, each of which is 40 
nucleotides in length. 

A CTATCTACCAGATTATTATGTGTTATAAGGTACTTTTTCT 
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B TATTGTGCTACTGATTCTTCACAGACTGAAGATTGTTGAA 
C TCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACA 
D TGAGGCTACCTCGACATACGTGTGCGCTATCAAAGTGGAA 
In other strains these sequences may vary, while still 
5 having substantial homology with those given above . In 
particular a site may have at least 90%, more preferably 
95%, homology with the sequence given above. 
Figures 3-5 show the distribution of initiation codons 
and open reading frames ( ORFs ) in the regions of the 
10 vaccinia virus genome shown in figures 19,20 and 21. 
Figures 6-8 show the same regions with a plot showing 
to what extent each reading frame conforms to the 
pattern of codon usage expected for vaccinia genes. A 
graph of codon usage is plotted for each of the three 
15 possible reading frames in each direction (Staden, R. , 
Nucl. Acids Res., 12, 521, 1984; Staden, R. , Nucl. Acids 
Res., 12, 551, 1984). In these codon usuage plots, the 
short vertical bar lines extending from the horizontal 
axes represent start codons. The longer vertical bar 
20 lines placed above the horizontal axes represent stop 
codons. This sort of plot is a useful way of helping to 
determine whether a particular ORF is a genuine vaccinia 
gene. Where there is a likely genuine gene, the graph 
of codon usuage will rise between a start codon and a 
25 stop codon. For example, in figure 7, it can be seen 
that the graph of codon usage rises over the region of 
the SalG2R ORF (the dotted line shows that this frame 
conforms most of the expected codon -usage ) . For the 
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other two frames the graphs show that they do not 
conform to vaccinia codon usage. The peak labelled 
'part of gk', and marked with a dashed line, also 
conforms well to vaccinia codon usage. In summary, a 
5 genuine gene must start with an initiation ( start ) 
codon, end with a termination (stop) codon, and should 
conform well to vaccinia codon usage along its length. 
In most cases the conformation to the vaccinia codon 
usage drops off sharply outside the gene. 
10 The neutral sites are further described as follows 
Site A. Gap between SalF17R and SalF19R 

Site A is marked on figures 3 and 6 . Figure 9 shows the 
actual DNA sequence with a translation of the ORFs on 
either side of the site. It can be seen that Site A is 

15 placed in an intergenic region between SalF17R and 
SalF19R. It is placed some 195 bases upstream of 
SalF19R to avoid any promoter elements associated with 
that gene. The sequence TTTTTCT (shown in italics) will 
act as a terminator of early RNA transcription for the 

20 SalF17R gene if it is an early gene. However, the site 
is placed downstream of the first of these, so it will 
not affect early termination of transcription if it 
occurs. Examination of figure 6 shows that there is no 
recognisable gene on the opposite strand at this point, 

25 and hence this sequence location is suitable as a 
neutral insertion site. 

Site B. Gap between SalF19R and SalF20.5R 

Site B is marked on figures 3 and 6 . Figure 10 shows 
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the actual DNA sequence with a translation of the ORFs 
on either side of the site. It can be seen that Site 
B is placed in an intergenic region between SalF19R and 
SalF20.5Ro Figure 6 shows that it is within a region 
5 of high vaccinia codon usage, but that this region does 
not form a genuine gene, having no initiation codon. 
In addition, figure 6 suggests that Sal20.5R is not a 
complete gene, as the conformation to vaccinia codon 
usage drops off dramatically at the start of the gene. 
10 In the event that SalF20.5R is a genuine gene, Site B 
is placed some 70 bases upstream of SalF20.5R which may 
well avoid any promoter elements associated with that 
gene. (Note: many vaccinia promoter elements are 
located in approximately 35 bases upstream of the start 
15 of the gene. ) In addition SalF20R has no TTTTTNT (N=any 
nucleotide) transcription termination signal with which 
Site B could interfere. Hence this sequence location 
is suitable as a neutral insertion site. 



Site Co Within SalG2R, a potential non- functional gene 
20 Site C is marked on figure 4. The ORF SalG2R has 
considerable similarity to the guanylate kinase ( GK) 
gene of yeast. This similarity is shown in figure 11. 
Sequence upstream of the SalG2R ORF (but in a different 
frame) has been added on to SalG2R, to see if the match 
25 to GK extends beyond the boundaries of the original open 
reading frame. The match appears to extend beyond the 
5 r end of the SalG2R ORF. In particular, an important 
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site in the yeast GK gene, the ATP/GTP binding site 
(shown underlined) only matches in the out of frame 
sequence upstream of the SalG2R ORF. Hence, it is very 
likely that the SalG2R gene is not active as a guanylate 
5 kinase and can be referred to as a 'pseudogene' • If the 
gene is inactive as the applicants deduce, then it will 
serve as a neutral insertion site. 

Site D. Within HindB3.5R, a potential non- functional 
gene 

10 Figure 5 shows that site D lies within the region 
designated HindB3.5R. This region, although conforming 
to vaccinia codon usage, has no start codon and is 
therefore not a genuine gene. The codon usage plot 
shown in figure 8 indicates that it probably was once 

15 a functional gene, and may well have been attached to 
HindB3R (a shift in the codon usage preference occurs 
here well away from the termination codon of the HindB3R 
ORF which suggests that the last section of HindB3R is 
not properly part of this gene. ) Hence it is likely 

20 that HindB3,5R is not active as a gene and can be used 
as a neutral insertion site. Figure 12 shows the actual 
DNA sequence with a translation of the ORFs on either 
side of the site. It can be seen that site D is placed 
in an intergenic region between HindB3R and HindB4R as 

25 well as being within the non- functional HindB3.5R. 

Preparation of vector for cloning of neutral sites 

In order to insert foreign genetic information into the 
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neutral sites described above, DNA copies of the neutral 
sites, together with an appropriate amount of flanking 
DNA from the vaccinia genome (approximately 500 bases 
on either side) must first be cloned into a plasmid 
5 vector o These plasmids may then be used to introduce the 
foreign DNA into the vaccinia virus genome; the vaccinia 
virus 1 flanking sequences * around the inserted gene 
serve to allow homologous recombination between the 
plasmid DNA and the viral DNA, with the consequent 
10 insertion of the foreign gene at the desired location. 

Cloning of neutral site sequences 

Plasmids containing flanking regions from the neutral 
sites are constructed as follows, DNA is prepared from 
the Wyeth strain of vaccinia virus by the method of 

15 Esposito et al., (Jo Virol, Meth. 2: 175, 1981), The 
polymerase chain reaction (PCR) is used to remove an 
approximately 1000 base pair (bp) fragment from DNA of 
the Wyeth strain of vaccinia virus . Pairs of 
oligonucleotides are chosen approximately 500 bp either 

20 side of the chosen neutral site. These oligonucleotides 
are based on the sequence of the WR strain, but are 
chosen in regions where the sequence of the WR strain 
is identical to that of the Copenhagen strain (Goebel 
et alo, Virology 179:247, 1990). The oligonucleotides 

25 incorporate restriction enzyme recognition sequences so 
that they can be cloned easily into a plasmid. For 
neutral sites A, B and D the restriction sites are EcoRl 



WO 92/16636 



PCT/GB92/00424 



35 

and Hindlll. For neutral site C the Hindlll site is 
replaced by an Sphl site, since there is an internal 
Hindlll site in the chosen flanking sequences. 
The oligonucleotides used for PGR are listed below: 
Site A leftMB 16 
Site A rightMB 17 
Site B leftMB 24 
Site B rightMB 25 
Site C leftMB 18 
Site C rightMB 19 
Site D leftMB 22 
Site D rightMB 23 

DNA fragments of approximately Ikb are then prepared 
using these pairs of oligonucleotides by PCR 
15 amplification, digested with EcoRI and Hindlll (for site 
A,B and D) or with EcoRI and Sphl (for site C) and 
cloned into Hindlll and EcoRI -digested pUC118 (figure 
13) to generate the plasmids pIMMC7a, pIMMC7b, pIMMC7c 
and pIMMC7d. 

20 Creation of unique restriction sites for insertion at 
the neutral sites 

A suitable restriction enzyme site is then introduced 
at the selected location within each of the plasmids. 
This is achieved using site directed mutagenesis using 
25 an oligonucleotide containing the desired new unique 
site and flanked by 15 bases of sequence to either side 
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(see below). The plasmids modified in -this fashion are 
designated pIMMC8a-d ( figure 13 ) . 

original oligonucleotidesite new plasmid 
plasmid introduced 



pIMMC7a MB35SnaBl pIMMC8a 

pIMMC7b MB36Hpal pIMMCb 

PIMMC7C MB37Stul pIMMC8c 

pIMMC7d MB38SnaBl pIMMC8d 



10 Cloning of the vaccinia virus early/late promoter 
sequences 

The p7.5 and H6 promoters from vaccinia virus genomic 
DNA are prepared by PCR amplification as described 
below. 

15 A pair of complementary oligonucleotides (S7 and S8) is 
synthesised to include the following restriction enzyme 
sites, Hindlll, SnaBl, Hpal, Hindlll, Sail, Ncol, Smal, 
SnaBl and EcoRl, such that the pair, after annealing, 
present at one end Hindlll compatible overhanging ends, 

20 and at the other, EcoRl compatible overhanging ends. 
The two oligonucleotides are allowed to anneal and are 
inserted into pUC118 cut with EcoRl and Hindlll (figure 
14). The resulting vector is called pIMMC3. 

A DNA molecule of approximately 180 bp containing 
25 the H6 promoter is removed from the WR strain of 
vaccinia virus by PCR amplification using the 
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oligonucleotides MB15 (anneals upstream and includes a 
5 ' -Sail site) and MB7 (anneals downstream and includes 
a 5' -Hindi I I site). This is cloned into pIMMC3 cleaved 
with Hindlll and Sail to create pIMMC4a (figure 14). 
5 A DNA molecule of approximately 200bp containing the 
p7.5 promoter is then removed from the WR strain of 
vaccinia virus by PCR amplification using the 
oligonucleotides MB32 (anneals upstream and includes a 
5 1 -Sail site) and MB33 (anneals downstream and includes 
10 a 5' Ncol site). This is cloned into pIMMC3 cleaved with 
Ncol and Sail to create pIMMC14b. 
Construction of the therapeutic virus 

The strategy required to generate a recombinant vaccinia 
virus containing and expressing the E6-E7 proteins from 
15 HPV16 and HPV18, based on the elements described above, 
involves five main stages as outlined below, 
i ) Cloning of the modified E6-7 genes downstream of 
vaccinia early promoter sequences 

A DNA fragment containing the modified HPV16 E6-7 
20 sequence is excised from pIMS7.2 by digestion with 

Hindi I I and Smal, and cloned into Hindi I I and 
Hpal-digested pIMMC4a to generate pIMS12 ( figure 
15). 

A DNA fragment containing the modified HPV18 E6-7 
25 sequence is excised from pIMS8.2 by digestion with 

Ncol and Smal, and cloned into Ncol and 
Smal-digested pIMMC14b to generate pIMS14 (figure 
15). . 
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ii ) Preparation of a plasmid vector containing both 
HP VI 6 and HP VI 8 E6-7 sequences together with their 
upstream vaccinia promoters . 

A DNA fragment containing the HP VI 8 E6-7 region 
together with the upstream p7 <, 5 promoter is 
excised from pIMS14 with Sail and Smal and 
inserted into Sail and Smal -digested pIMS12 to 
generate pIMSIS (figure 15) 

iii) Insertion of the HPV E6-7/promoter "double" 
cartridge into the neutral site containing 
plasmids - 

A DNA fragment containing both the HPV16 and HP VI 8 
E6-7 coding regions together with their upstream 
promoter elements is excised from pIMSIS with 
SnaBl and inserted into the appropriately-digested 
neutral site-containing plasmids pXMMC7a-d. This 
step is shown in figure 16, and the resulting 
plasmids are designated pIMMC9a-d. 
iv) Introduction via homologous recombination of the 
neutral site DNA, together with the intervening HPV 
sequences, into the vaccinia virus genome to create 
a recombinant virus expressing the two modified HPV 
E 6 - 7 sequences . 

The recombinant plasmids pIMMC9a-d are purified 
and allowed to recombine into vaccinia ( figure 17 ) 
using standard protocols (Mackett et al., in D.M. 
Glover (ed) DNA Cloning: a Practical Approach, 
Oxford and Washington DC, IRL Press, 1985). 
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Viruses which have acquired the HPV sequences are 
identified by probing with radiolabelled HPV 
specific sequences. Viral plaques are lifted onto 
nitrocellulose (Villareal and Berg, Science 196, 
183, 1977) and probed with radiolabelled Ncol-Smal 
fragment from pIMS14 containing the HPV18 E67 
gene* Recombinant viruses are then isolated from 
the agarose overlay and plaque purified three 
times. They are checked for the presence of the 
appropriate DNA sequences by Southern blotting of 
purified virus DNA using DNA probes derived from, 
the HPV E6 and E7 genes, and for expression of the 
appropriate sequences by western blotting using 
antisera specific for the HPV E6 and E7 proteins. 

15 Cloning of the therapeutic virus in MRC5 cells 

Stocks of the final recombinant virus are prepared 
by growth in Vero cells, and are used to infect 
MRC5 cells deemed suitable for the preparation of 
material suitable for use as human vaccines. The 

20 virus is plaque-purified three times by standard 

methods, and finally a stock prepared for clinical 
use. 

Confirmation of presence of the correct HPV DNA 
insert 

25 A sample of this stock virus is checked once again 

for the presence of correctly configured virus 
DNA, and for expression of the correct virus 
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proteins- Figure 22 shows -the analysis by FCR, of 
a recombinant: vaccinia virus (v9a.l) in which the 
HPV DNA cassette is inserted at Site A. The 
diagram shown in panel (a) indicates the DNA 
fragments expected if insertion of the correct DNA 
has occurred. It can be seen in panel (b) that 
the actual pattern of PCR products generated is 
consistent with that expected. 

Confirmation of expression of the HPV DNA insert 
The recombinant viruses are then checked for 
expression of the expected HPV proteins. An 
example of this analysis is shown in Figure 23 . 
Vero cells are infected with with recombinant 
virus v9a.l (HPV DNA inserted at Site A) , and the 
cells examined by western blotting for the 
presence of the HPV E67 fusions proteins using 
monoclonal antibodies specific for the HPV16 E7 
protein (camvir3) and for the HPV 18 E7 protein 
(7E10). It can be seen that both monoclonal 
antibodies recognise specifically proteins of the 
expected size in cells infected with the 
recombinant virus v9a.l, but not in cells infected 
with the control parent virus Wyeth strain. These 
recognised proteins co-migrate with proteins 
synthesised by in vitro translation of mRNA 
encoding the expected HPV fusion proteins (HPV 16 
E67 and HPV18 E67 ) . This experiment indicates 
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successful expression of the heterologous gene 
sequences from the recombinant virus. 
Stability of the HPV DNA insert 

For the recombinant virus to be of use clinically, 
it is important that the inserted sequences remain 
genetically stable over multiple virus passage, 
and the DNA insert was carefully designed to 
promote this genetic instability. To confirm the 
stability of the HPV information within the 
recombinant virus genome, the virus is subjected 
to 9 serial passages, (multiplicity of infection 
=10pfu/cell) in Vero cells. Subsequently 20 
plaque isolates are picked, and analysed for the 
presence of the correct HPV DNA insert by PCR 
analysis as described in Figure 22. The data 
obtained for recombinant virus A are shown in 
Figure 24. All 20 virus isolates retained the HPV 
information in the expected genetic arrangement 
indicating a considerable degree of genetic 
stability. 

Animal experiments 

The virulence of the recombinant virus is compared 
in animal experiments with that of the parental 
Wyeth strain. Groups of 20 mice are inoculated 
intranasally each with 10 7 pfu of Wyeth strain or 
recombinant virus in a total volume of 20pl . Two 
mice are sacrificed at 1 day, 3 days and 5days 
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following inoculation, and the lungs dissected 
out* The amount of virus present in the lungs is 
then measured by grinding the tissue, and assay of 
the homogenate by standard vaccinia virus plaque 
5 assay. The results of such an experiment for the 

recombinant virus v9a.l (HPV infromation inserted 
at site A) are shown in Figure 25. It can be seen 
that the recombinant virus retains the ability to 
replicate in mice , and that the level of virus 
10 produced in the lungs of the infected animals is 

similar to that seen with the parental Wyeth 
strain. 



Therapeutic Use 

A stock of the recombinant virus is prepared by 
15 infection of MRC5 cells, and adjusted to a 

concentration of not less than 10 8 pfu/ml. 20 pi 
of this material is applied to the arm of the 
patient, which is then scarified through the virus 
droplet with a bifurcated needle, according to the 
20 standard procedure used for vaccination against 

smallpox. 



WO 92/16636 



PCT/GB92/00424 



43 

CLAIMS 

1. A recombinant virus vector for use as an 
immuno therapeutic or vaccine which comprises at least 
one pair of nucleotide sequences heterologous to said 
5 virus which have sufficient sequence homology that 
recombination between them might be expected ; 

wherein said pair of nucleotide sequences are 
arranged in said virus vector such that they are 
inverted with respect to each other to reduce the 
10 likelihood of recombination events leading to loss of 
part or all of said sequences and said virus vector is 
able to infect a mammalian host cell and express as 
polypeptide the heterologous nucleotide sequences in 
said host cell. 

15 2. A recombinant virus vector according to claim 1 
wherein at least one pair of nucleotide sequences 
encode part or all of human papillomavirus (HPV) wild- 
type proteins or mutant proteins immunologically 
cross -reactive with said wild-type proteins. 

20 3. A recombinant virus vector according to claim 1 
or claim 2 wherein the pair of nucleotide sequences 
encode part or all of the HPV wild-type proteins 
HPV16E7 and HPV18E7 or mutant proteins immunologically 
cross-reactive therewith. 
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4. A recombinant: virus vector according to claim 1 
or claim 2 wherein the pair of nucleotide sequences 
encode part or all of the HPV wild- type proteins 
HPV16E6 and HPV18E6 or mutant proteins immunologically 

5 cross -reactive therewith, 

5. A recombinant virus vector according to claim 3 
which comprises a further pair of nucleotide sequences 
which encode part or all of the HPV wild-type proteins 
HPV16E6 and HPV18E6 or mutant proteins immunologically 

10 cross-reactive therewith. 

6. A recombinant virus vector according to any one 
of the preceding claims wherein two or more nucleotide 
sequences of different said pairs may be fused 
together to form a single open reading frame. 

15 7. A recombinant virus vector according to claim 6 
wherein the fusions are via a single codon encoding a 
small neutral amino acid. 

8. A recombinant virus vector according to claim 7 
wherein the amino acid is glycine. 

20 9 . A recombinant virus vector according to claim 5 
in which the pairs of nucleotide sequences are 
arranged in the virus vector according to any one of 
the options as shown in Figure 26. 
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10. A recombinant virus vector according to claim 9 
which comprises 

a first open reading frame having a fused genetic 
sequence encoding part or all of the wild-type 
5 proteins E6 and E7 from HPV16; and 

a separate second open reading frame having a 
fused genetic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HPV18; 

wherein the first and second open reading frames 
10 may be inverted with respect to one another whereby 
either: 

i) the E6 coding sequences of HPV16 and HPV18 
are both located between the E7 coding sequences of 
HPV16 and HPV18; or 
15 ii) the E7 coding sequences of HPV16 and HPV18 

are both located between the E6 coding sequences of 
HPV16 and HPV18; and 

wherein any of said wild-type proteins may be 
replaced by a mutant protein immunologically cross - 
20 reactive therewith. 

11. A recombinant virus vector according to claim 10 
wherein each of the first and second open reading 
frames has a corresponding promoter and the two open 
reading frames each with its promoter, are arranged 

25 next to each other in the virus. 

12. A recombinant virus vector according to claim 11 
wherein either: 
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i) the promoters are located between the first 
and second open reading frames whereby the open 
reading frames are transcribed outwardly; or 

ii) the promoters are located outside the first 
5 and second open reading frames whereby the open 

reading frames are transcribed inwardly. 

13. A recombinant virus vector according to any one 
of claims 9 to 12 which comprises 

a first open reading frame having a fused genetic 
1 0 sequence encoding part or al 1 of the wi Id - type 
proteins E6 and E7 from HPV16; and 

a separate second open reading frame having a 
fused genetic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HPV18; 
15 wherein the E6 coding sequences of HPV16 and 

HP VI 8 are both located between the E7 coding sequences 
of HPV16 and HP VI 8 ; and 

each open reading frame has a corresponding 
promoter, the promoters being located between the 
20 first and second open reading frames whereby the open 
reading frames are transcribed outwardly; and 

wherein any of said wild-type proteins may be 
replaced by a mutant protein immunologically cross- 
reactive therewith. 

25 14. A recombinant virus vector according to any one 
of the preceding claims wherein either or both of the 
nucleotide sequences in a said pair of nucleotide 
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sequences are altered to make them less homologous 
than an equivalent pair of nucleotide sequences 
encoding wild-type HPV proteins. 

15. A recombinant virus vector according to claim 14 
5 wherein the alteration in nucleotide sequence does not 

result in an alteration of the encoded amino acid 
sequence . 

16. A recombinant virus vector according to claim 3 
wherein the wild-type proteins HPV16E7 and HPV18E7 are 

10 replaced with mutant proteins which are substantially 
homologous to said wild -type proteins and in which the 
residues cys 24 and glu 26 of wild-type protein 
HPV16E7 and the residues cys 27 and glu 29 of wild- 
type protein HPV18E7 are replaced with glycine 

15 residues. 

17. A recombinant virus vector according to any one 
of the preceding claims wherein said heterologous 
nucleotide sequences may comprise part or all of the 
sequences shown in Figures 1(a) and 1(b). 

20 18. A recombinant virus vector according to any one 
of the preceding claims which is derivable from 
vaccinia virus. 

19. A recombinant virus vector according to any one 
of the preceding claims wherein the nucleotide 
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sequences are inserted into "the virus vector at one or 
more neutral sites, the disruption of which by the 
insertion of the nucleotide sequences does not 
substantially adversely affect viral functions 
5 relating to the replicative ability of the virus in 
the mammalian cell. 

20 • A recombinant virus vector according to claim 19 
which is derivable from vaccinia virus and wherein the 
neutral sites may be one or more of: 

A ) the gap between SalIF17R and SalIF19R of 
strain WR comprising at least part of the sequence 
CTATCTACCAGATTATTATGTGTTATAAGGTACTTTTTCT; 

B) the gap between SalIF19R and SalIF20.5R of 
strain WR comprising at least part of the sequence 
TATTGTGCTACTGATTCTTCACAGACTGAAGATTGTTGAA; 

C) a region in SalIG2R of strain WR comprising 
at least part of the sequence 

TCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACA; 

D) a region in HindB3.5R of strain WR comprising 
at least part of the sequence 
TGAGGCTACCTCGACATACGTGTGCGCTATCAAAGTGGAA; 

E) a sequence having at least 90% sequence 
homology to those sequences A) to D) identified above. 

21. A method for making a recombinant virus vector 
25 according to claim 19 or claim 20 which comprises 
inserting a said heterologous nucleotide sequences into 
one or more neutral sites in a virus vector, the 
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disruption of such a site by said insertion will not 
substantially adversely affect the replicative ability 
of the virus and wherein the neutral site has been 
previously identified by: (a) analysing a viral genome 
5 to identify open reading frames which are likely to 
encode functional genes; and (b) selecting sites between 
open reading frames for functional genes or sites within 
sequences for non- functional genes. 

22. A recombinant virus vector obtainable by the method 
10 of claim 21. 

23. A method which comprises using a recombinant virus 
vector according to any one of claims 1 to 20 or to 
claim 22 to manufacture a medicament for use as an 
immuno therapeutic or vaccine against a condition thought 

15 to be caused by HPV infection , for example cervical 
cancer . 

24. A method which comprises using a recombinant virus 
vector according to any one of claims 1 to 20 or to 
claim 22 to specifically activate cells of the immune 

20 system to HPV proteins. 
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TflGflGTflTGflGTflflGflTflGTGTTTTTCflflflRCflTflTflflflTCTRflflflTTGflTGGflTGflGflT 



flTflCflGCTflTTflflTTTCGflflflflTRTRTTTTRflTCTGflTflRCTTTflflflCflTGGflTTTTTGfl 

S a I F 1 9 R 

TGGTGGTTTflfiCGTTTTflflflflflflflGRTTTTGTTflTTGTRGTflTflTGflTflflTflTTflflflRGfl 

n I I L K D 

TGGflTRTflflflGflflTTTGCTGflCTGCflTGTflCTflTTTTTTflCflTTRCTRCflTTGGCTflCGG 

CV.KEFflOCflVVFLHVY I GVG 
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Fig. 10. 



So I F20R 

TGCCflflGGTTflGflTGTflflTGGTflflCGfiTflflCflCflflflflTGTGflflCGCTGCCCnCCTCRTflC 

flKu nCMGMDHT>:CERCPPHT 

flTflTflCCflCflflTCCCflflTTfiTTCTflflTGGflTGTCflTCflflTGTflGflflfiflTGCCCflflCCGGfl 
V T T 1 P 1 1 L n 0 U I M u E H fl 0 P D 

TCflTTTGflTflflGGTflflflGTGTflCCGGflflCfiCflGflHCflGTflflflTGTTCGTGTCTTCCTGGT 
H L I R n 

Site B 

TGGTflTTGTGCTflCTGfiTT£H£fl£.fiGflCTGflflGflTTGTTGflflflTTGTGTflCCflflflHflGG 

Sol F20 . 5R 

fl GflTGTCCflTGCGGflTflCTTTGGTGGflfiTflGHTGRRCflflGGflflflTCCTflTTTGTflflflTCG 

flNKEILFUNR 

TGTTGTGTTGGTGflflTflTTGCGflCTRCCTRCGTflflTTflTflGflCTTGflTCCflTTTCCTCCn 

U U L U H I fl T T V U I I D L I H F L H 

TGCflflflCTflTCTHflfiTGTRRTTRflTTflTGfiTTTTGflTGflTflflTGTTflCCflTflCflTTflTflT 
flMVLH UlMVDFOOHUTlHVI 

CGCTflCTTGGTTflGTGTRTTflTTCflGTflTGflflGflCCTflTTflflTflflTTflCTTflTCTTTTGfl 1560 
ATULUVVSU" 

CGflTCTTGTTflTflflTTflTflflTflTflflflflflTRCTTflTGGCflTRGTflflCTCflTflflTTGCTGflC | 620 
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Fig. 11. 



Gap Ueight. 3.000 Average Hatch. 0.540 

Length lieight. 0.100 fluerage nismatch. -0.396 

Quality. 113.0 Length. 199 

Ratio. 0.608 Gaps. 4 

Percent Similarity. 56.354 Percent Identity*. 29.232 

The top sequence is SalG2R and the bottom sequence is the yeast GK 



Start of G2R 

1 flSGIUKSI ILS GPSGLGKTR IflKRLflGIV. . . LDLUCP I PLOFLULttERE 4 7 



1 SRPIUIS GPSGTGKS TLLKKLFREVPOSFGFSUSSTTRTPRRGEUN -16 

48 GUDVHVUHRERIUKG I flflGNFLEHTEFLGN I YGTSKTflUNTflfl I HHRI CU 97 

47 GKOVHFUSUDEFKSniKHHEF lEUROFSGNYYGSTUflSUKOUSKSGKTC I 96 

98 nDLN I OGURSLKN . TVLflPVSUV I RPTSDCflUETJCLRCRHTERHOE I HRR 1 46 

97 LD I OMQGUKSUKfl I PELNflRFLF I RPP S UED LKKR LEG R GTE TEES I NKR 1 46 

147 UlLRICTOnDEflNEflGLFDTI I I EQOUNLRYSKL I 0 • I LQOR I RI1VFNTH 194 

147 LSRRQRELRYfl . ETGRHOKU I UNOOLDKRVKELKDF IFflEK 1 86 



SUBSTITUTE SHEET 

BWSOOC1D: <WO 991 0636*. 1_l_> 



WO 92/16636 



PCT/GB92/00424 



Fig. 12. 



HlndB3R 

CTAAGAACACGTATACGGCAGCAGCTTCCT7TATACTCTCATCTTTTACCAACACAAAGG 507 
LRTRIRQQLPLYSHLLPTQR 

GTGGATATTTGTTCATTGGAGTTGATAATAATACACACAAAGTAATTGGATTCACGGTGG 567 
VDICSIELIIIHTK* 

GTCATGACTACCTCAGACTGGTAGAGAATGATATAGAAAAGCATATCAAAAGACTTCGTG 627 



TTGTGCATTTCTGTGAGAAGAAAGAGGACATCAAGTACACGTGTCGATTCATCAAGGTAT 



I I I I I I 



687 



Site D 

ATAAACCTGGGGATGAGGCTACCTCGACAIAX5IG.TGCGCTATCAAAGTGGAAAGATGCT 747 
GTTGTGCTGTGTTTGCAGATTGGCCAGAATCATGGTATATGGATACTAATGGTATCAAGA 807 
AGTATTCTCCAGATGAATGGGTGTCACATATAAAATTTTAATTAATGTAATAGAGAACAA 867 
ATAATAAGGTTGTAATATCATATAGACAATAACTAACAATTAATTAGTAACTGTTATCTC 



927 



AACTAACCAACTAACTATATACCTATTAATACATCGTAATTATAGTTCTTAACA 987 



TCTATTAATCATTAATTCGCTTCTTTAAT7Trrr>jrAAACTAACATTGTTAATTGAAAAG 1047 

HindB4R 

GGATAACATGTTACAGAATATAAATTATATATGGATTTTTTTAAAAAGGAAATACTTGAC 1107 

MDFFKKEILO 

TGGAGTATATATTTATCTCTTCATTATATAGCACGCGTGTTTTCCAATTTTTCCACATCC 1167 
WS IYLSLHYIARVFSNFSTS 
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Hindill EcoRI 



SnaBi Hpal Hindlll Sail Ncol Smal SnaBI 




Hindlll 
encs 



Synthetic oligo 
S7-S8 




SnaBf Hpai Hindlll Sail Ncol Smal SnaBI 



pH6 promoter 



PCR product digested 
with Sail and Hindlll 



pUC118 

mmmm^m ^ 

ampR 



with Hindlll and EcoRI 




p7.5 promoter 



PCR product digested 
with Sail ano Ncol 



SnaBI Hpal Hindlll Sail Ncol SmalSnaB 




pH6 

plMMB4a 



Ulx.-A.U.'.-. ' .'. i .W 



SnaBI Hpal Hindlll Sail Ncol Smal SnaBI 
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Hindm SalI NcoI 



SnaBI^ 3 ^ /^SnaBI 



w 

pIMMBUb 
ampr 



HPV16 E67 



Hindm/Smal 
fragment - from 

plMS7.2 



Digest with 
Hindlll/Hpal 



Digest with 
Ncol/Smal 



SnaBI 



Sail 

t HindHI 
I HPV16 E67 I , 



-SnaBI 



HindHI 
SnaBI H P aI 



HPV18E67 

V ////////A 

Ncol /Smal 
fragment from 
PIMS8.2 



Sail 
Ncol 



plMS12 
ampr 




Smal 



HPV18 E67 
'///////, 



SnaBI 




plMS13 
ampr 



Digest with 
Sail and Smal 



Sail 



SnaBI HPV16 CT^SnaBI 



plMS12 
ampr 



^miSmalc- 



Digest with 
Sail and Smal 

t 

Sail 
Ncol 



Smal 
, HPV18 E67 



" Sail 
c DT Hindin Ncol 

SnaBI HPV16 E67 



Smal 



Fig. 15. 



^nabi HPV16E67 I ^ IHPV18 E67 , 



SnaBI 



plMS15 



amDr 
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Fig. 16 



SnaBI Hpal 



Sail 

Hmdlll I Ncoi 




Smal SnaBI 
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w^mzzzzzzzzt 



HPV16E ^ 7 pH6p7.5 HPV18&7 



PIMS15 



ampR 



Cut with SnaBI 
Isolate E6-7 
fragment 



Unique srte 
I 



neutral site 

plMMB8 a-d 



ampR 



Cut at unique site 



Hpal 



Sail 
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Smal 



srtelef! HPV16 E6 - ? P H6 P 7 - 5 HPV 18 E5 - 7 
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Fig. 17. 



plMMB9 a-d 



i u HPV16E67 m-vi 



HPV18E67 



homologous 
recombination 





-033233333333- 
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Recombination 
plasmtd 



Virus genome 
— - — — - containing neutral 
site 



Recombination between 
plasmid and virus genome. 
Recombinant selected by probing 
with HPV-speific sequences. 
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Fig. 18. 

MB 0 3 TCAGGATCCCACATGAGCGAAAAATACATCG 
MB07 TOVAAGCTTATTACGATACAAACTTAACGGA 
MB 0 9 TCAGTTAACATAAAAAGAACAACGCCCGGCAG 
MB1 4 TCAAGGGCCTCTATATAGTAATACCAATACTC 
MB 1 5 TCAGTCGACTTACAAACAACTAGGAAATTGG 
MB1 6 TCAGAATTCTATGTACAGAGGTCTATTAGGC 

MB17 TCAAAGCTTGTATGAGGTGGGCAGCGTTCAC 

MB1 8 TCAGAATTCTTAATTATATTGTCGGCCGTGG 

MB1 9 TCAGCATGCATGATCCGTTAGCTTTGGGCTC 

MB22 TCAGAATTCGAAGCTCTAGAGTATCTTAGCG 

MB 2 3 TCAAAGCTTTCCTGTATTATATGGGATGTGG 

MB24 TCAGAATTCATTGATGGATGAGATATACAGC 

MB25 TCAAAGCTTTCACAAAATCG 

MB2 6 TCAGAATTCCACGTATACGGCAGCAGCTTCC 

MB2 7 TCAAAGCTTTGTTCTACGTCCATTTTCAAGC 

MB32 TCAGTCGACATACCAATACTCAAGACTACGA 

MB33 TCACCATGGATTGCTATTGATTGAGTACTGTTC 

MB 3 5 AGTACCTTATAATACGTAATAATCTGGTAG 

MB 3 6 AATCTTCAGTCTGTTAACAATCAGTAGCAC 

MB 3 7 TACAACGAAGCTAGGCCTCAACCATTTTAA 

MB38 TTTGATAGCGCATAC GTATGTCGAGGTAGC 

MB 3 9 AATAGGTATATAGTTAACTGGTTAGTTAAA 

501 ATCCCATGGCGCGCTTTGAGGATCCAAC 

502 TCACCCGGGTTACTGCTGGGATGCACACCAC 

s 0 5 ATCCCATGGACCAAAAGAGAACTGCAATGTTTC 

SO 6 TCACCCGGGTTATGGTTTCTGAGAACAGATG 

507 AGCTCTACGTAGTTAACAAGCTTGTCGACCCATGGCCCGGGTACGTA 

508 AATTTACGTACCCGGGCCATGGGTCGACAAGCTTGTTAACTACGTAG 
S2 0 GAAACCCAGCTGGGAATCATGCATGG 

521 GAAACACAAGTAGGAATATTAAGTATG 

522 GATCTCTACGGTTATGGGCAATTAAATGAC 
S 2 3 GACCTTCTAGGTCACGGGCAATTAAGCGAC 

S2 4 ATGTATAGATTTCTAC AGTAGAATCAGAGAATTAAG 
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TATTGTTTACGAGCTATTAAACTGTTTATTAATC^ 6 q 

YCLRAIKLF INDHMLDKIKS 
IVYELLNCLLMITCLIR*NL 
LFTSY*7VY* * S H A * * D K I Y 

ATACTGCAGAATAGACTAGTATATGTGGAAATGTCATAGAAAGTTAAAAGTTAATGAGAG 1 2 0 

ILQNRLVYVEMS*KVKS**E 
YCRID *YMWKCHRKLKVNES 
TAE *TSI-CGN"VIES * KLMRA 

CAAAAATATATAAGGTTGTATTCCATATTTGTTATTTTTTCTGTAA 180 
QKYIRLYSIFVIFSVIVRKI 
KNI*GCIPYLL?FL**LEKY 
KI YKVVFHICYFFCNS * K N T 

CATTCGATGGTCTATCTACCAGATTATTATGTG1TATAAGGTACTTTTTCTCATAATAAA 24 0 

HSMVYLPDYYVL*GTFS HNK 
IRWSIYQIIMCYKVL-FLIIN 
FDGLSTRLLCVIRYF FS* * T 

CTAGAGTATGAGTAAGATAGTGTTTTTCAAAACATATAAATCTAAAATTGATGGATGAC^ 300 
LEYE*DSVFQNI»I*N*WMR 
* SMSKIVFFKTYKSKIDG* D 
RV*V R*CFSKHINLKLMDEI 

TATACAGCTATTAATTTCGAAAATATATTTTAATCTGATAACTTTAAACATG 360 
YTAINFEN IF*SDNFKHGFL 
IQLLISKIYFNLITLNMDF* 
YS Y * F RKYI L I * -* L * TWIFD 

ATGGTGGTTTAACGTTTTAAAAAAAGATTTTGro 4 20 

WVV*RFKKRFCYCSI**Y*K 
W W F N V L K K D F V I V V Y D N : K R 
GGLTF'KKILLL'YMI ILKD. 

ATGGATATAAAGAATTTGCTGACTGCATGTACTATTTTTTACATT^ 480 
MDIKNLLTACTIFYITTLAT 
W I *RIC*LHVLFFTLLHW1R 
GY K EFADCMYYFLHY Y IGY G 

GCAGATATACCTACTCCGCCACCAACGGGTCATGTGACAAGGGAGAATATCTTGATAAGA ' 540 

ADIPTPPPTGHVTRENILIR 
Q I Y LLRHQRVM * QGR I S * * ~ 
RYTYSATNGSC DKGE YLDK ~R 

GGCATAATCAATGTTGTAATCGGTGTCCACCTGGAGAATTTGCCAAGGTTAGATGTAATG 600 
GI INVVIGVHLENLPRLDVM 
A»SML»SVSTWRICQG*M-W 
HNQCCNRCPPGEFAKVRCNG 

GTAACGATAACACAAAATGTGAACGCTGCCCACCTCATACATATACCACAATCCCAATTA 660 
V TI7QNVNAAHLIHIPQSQL 

* R w hkm*tlptsyiyhnp"ny 

NDNTKCERC-FPHTYTTIPI I 
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960 



TTCTAATGGATGTCATCAATGTAGAAAATGCCCAACCGGATCATTTGATAAGGTAAAGTG 720 
F*WMSSM*KMPNRII».GKV 
SNGCHQCRKCPTGSFDKVKC 
LMDVI NVENAQPDHL I R « S V 

TACCGGAACACAGAACAGTAAATGTTCGTGTCTTCCTGGTTGGTATTGTGCTACTGATTC 780 

yrnteq*mfvsswlvlcy* r 
tgtqnskcsclpgwycatd's 

PEHRTVNVRVFLVGIVLLIL 

TTCAC^GACTGAAGATTGTTGAAATTGTGTACCAAAAAGGAGATGTCCATGCGGATACTt 840 
FTD*RLLKLCTKKEMSMRIL 
SQTEDC*NCVPKRRCPCGYF 
HRLKIVEIVYQKGDVHADTL 

TGGTGGAATAGATGAACAAGGAAATCCTATTTGTAAATCGTGTTGTGTTGGTGAATATTG 900 
WWNR«TRKSYL'IVLCW»IL 
GGIDEQGNPICKSCCVGEYC 
VE "MNKEILFVNRVVLVNIA 

CGACTACCTACGTAATTATAGACTTGATCCATTTCCTCCATGCAAACTATCTAAATGTAA 
RLPT*L»T»SISSMQTI«'M» 
DYLRNYRLDPFPPCKLSKCN 
TTYVIIDLIHFLHANYLNVI 

TTAATTATGATTTTGATGATAATGTTACCATACATTATATCGCTACTTGGTTAGTGTATT 1020 
LIMILMIMLPYIISLLG-CI 
*L*F*»»CYHTLYRYLVSVL 
NYDFDDNVTIH YIATWLVYY 

ATTCAGTATGAAGACCTATTAATAATTACTTATCrTTTGACGATCTTGTTATAATTATAA ^ 080 

IQYEDLLIITYLLTILL-L- * 
FSMKT' Y** LI,IF»RSCYNYN 
SV«RPlNNYLSFDDLVIIii, 

TATAAAAATACTTATGGCATAGTAACTCATAATTGCTGACGCGATAAATTCGTAATAATC 114 0 

YKNTYGIVTHNC*RDKFVI I 

IKrLMA»»LIIADAINS»»S 
KYLWHSNS » L, L T R * I RNNL 

TGTTTTGTTCAAATTTTTATAAGGAATCrAC^GGCATAAAAATAAAAATATAATTTATAA 1200 
CFVQIFIRNLQA»K'KYNT - 

vlf?cfl*giyrkknknii~'yn 

FCSNFYKESTGIKIKI * F I ~ 

TATACTCTTACAGCGCGCCATCATGAATAACAGCAGTGAATTGATTGCTG 1 2 6 0 

YTLTARHHE'QQ-IDC 
ILLQRAIMK N5SELIA 
YSYSAPS' ITAVN'LL 
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Fig. 20. 

ATATTTGGTATTACCGCATTAATTATATT^ 6 0 

I F GITALI ILSAVAIFC ITY 
YLVLPH* LYCRPWQFSVLH I 
IWYYRINYIVGRGNFLYYIL 

TATATATATAATAAACGTTCACGTAAATACAAAACAGAGAACAAAGTC^^ 120 
YIYNKRSRKYKTEN KV* IFD 
IYIINVHVNTKQRTKSRFLT 
YI**TFT*IQNREQSLDF*L 

TTACATAAATGTCTGGGATAGTAAAATCTATCATATTGAGCGGACCATCTGGTTTAGGAA ISO 
LHKCLG* *NLSY*ADHLV* E 
YINVWDSKIYHIERTIWFRK 
T*MSGIVKSIILSGPSGLGK 

AGACAGCC^TAGCCAAAAGACTATGGGAATATATTTGGAT^ 240 
RQP*PKDYGNIFGFVVS HTT 
DS HSQKTMGIYLDLWCPIPL 
TAIAKRLWEYIWICGVPYH * 

AGATTTCCTCGTCCTATGGAACGAGAAGGTGTTGATTACCATTACGTTAACAGAGAGGCC 300 
RFP RPMEREGVDYHYVNREA 
DFLVLWNEKVLITITLTERP 
ISSSYGTRRC*LPLR*QRGH 

ATCTGGAAGGGAATAGCCGCCGGAAACnTTCTAGAACATACTGA Gl w lT lT AGGAAATATT 360 
IWKGIAAGNFLEHTEFLGNI 
SGRE*PPETF*NILSF*EIF 
LEGNS R'RKLSRTY * V F RKY L 

TACGGAACTTCTAAAACAGCTGTGAATACAGCGGCTATTAATAATCGTATTTGTGTGATG 420 
Y GTSKTAVNTAAINNRI CVM 
TELLKQL* IQRLLIIVFV-W 
RNF*NSCEYSGY* *SYLCDG 

GATCTAAACATCGACGGTGTTAGAAGTCTTAAAAATACGTACCTAATGCCTTACTCGGTG 4 80 

DLNIDGVRSLKNTYLMP YSV 
I *TSTVLEVLKIRT*CLTRC 
SKHRRC* KS*KYVPNALLGV 

TATATAAGACCrACCTCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACACTGAA 540 
YIRPTSLKMVETKLRCRNTE 
Z *DLPLLKWLR?SFVVETLK 
YKTYLS*NG*DQA£L*KH*S 
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Fig. 20 (Con t ). 



600 



GCTAACGATGAGATTCATCGTCGCGTGATATTGGCAAAAACGGATATGGATGAGGCCAAC 
A N D E I KRRVILAKTDMDEAN 
-TMRFIVA-YWQKRIWMRP- 

-r»dsssrd:gkngyg*gqr 

GAAGCAGGTCTATTCGACACTATTATCATTGAAGArGATGTGAATTTAGCATATAGTAAG 6 6 0 

c-AGLFDTI I I E D D V N t AYSK 

KQVYSTLLSLKMM - I - H IV S 

SRSIRHYYH'R »CEFSI * "V 

TTAATTCAGATACTACAGGACCGTATTAGAATGTATTTTAACACTAATTAGAGACTTAAG 720 

liqilqdrirmyfntn* rlk 
• fryyrtvleciltlirdlr 

NSDTTGPY*NVF*H-LET-D 
ACTTAAAACTTGATAATTAATAATATAACTCGTTTTTATATGTGGCTATTTCAACGTCTA 780 

t * n liinn itrfymwlfqrl 

LKT" * L I I tr LVFICGYFNV* 
LKLDN**YNSFLYVAISTSN 

ATGTATTAGTTAAATATTAAAACTTACCACGTAAAACTTAAAATTTAAAATGATATTTCA 8 4 0 

MY*LNIKTYHVKLKI*NDIS 
CIS *ILKLTT*NLKFKMIFH 
VLVKY*NLPRKT«NLK - YFI 

TTGACAGATAGATCACACATTATGAACTTTCAAGGACTTGTGTTAACTGACAATT 
LTDRSHIMNFQGLVLTDNCK 
*QIDHTL*TFKDLC*LTIAK 
DR-ITHYELSRTCVN'QLQK 

AATCAATGGGTCGTTGGACCATTAATAGGAAAAGGTGGATTCGGTAGTATTTATACTACT 
NQWVVGPLIGKGGFGStv-- 

INGSLDH*»EKVDSVVF T *L L 
SMGRWTI NRKRWI R * Y L ""y Y • 

AATGACAATAATTATGTAGTAAAAATAGAGCCCAAAGCTA 
NDNNYVVKIEPKA 
MTIIM**K*S?KL 
*Q*LCSKNRAQS*'s 
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Fig. 21. 



ACCATCGAGGTAACCACCTCTCTGGAAGACAGCGTGAATAATGTACTCATGAAACGTTTG 60 
TIEVTTSLEDSVNNVLMKRL 
PSR*PPLWKTA* IMYS * N V W 
HRGNHLSGRQRE *CTHETFG 

GAAACTATACGCCATATGTGGTCTGTCGTATATGATCAT1TTGATATTGTGAATGGTAAA 120 
ETIRHMWSVVYDHFDIVNGK 
KLYAICGLSYMI ILIL * M V K 
NYTPYVVCRI* SF*YCEW*R 

GAATGCTGTTATGTGCATACGCATTTGTCTAATCAAAATCTTATACCGAGTACTGTAAAA 180 
ECCYVHTHLSNQNLIPSTVK 
NAVMCIRICLIKILYRVL *K 
MLLCA YAFV*SKSYTEYCKN 

ACAAATTTGTACATGAAGACTATCGGATCATGCATTCAAATC 240 
TNLYMKTMGSCIQMDSMEAL 
QICT*RLWDHAFKWIPWKL* 
KFVHEDYGIMHSNGFHGSSR 

GAGTATCTTAGCGAACTGAAGGAATCAGGTGGATGGAGTCCCAGACCAGAAATGCAGGAA 3 00 

EYLSELKESGGWSPRPEMQE 
S I LAN * RNQVD GVP D Q K CRN 

VS*RTEGIRWMESQTRNAGI 

TTTGAATATCCAGATGGAGTGGAAGACACTGAATCAATTGAGAGATTGGTAGAGGAGTTC 360 
FEYPDGVEDTES IERLVEEF 

lniqmewktlnql rdw * rs s 
* isrwsgrh*in*eigrgvl' 

ttcaatagatcagaacttcaggctggtgaatcagtcaaatttggtaattctat^ 420 

FNRSELQAGESVKFGNS I N T V 
SIDQ NFRLVNQSNLVI LLML 
Q*IRTSGW»ISQIW*FY*C* 

AAACATACATCTGTTTCAGCTAAGCAACTAAGAAC^CGTATACGGCAGCAGCTTCCTTTA 480 
KHTSVSAKQLRTRIRQQLPL 
NIHLFQLSN'EHVYGSSFLY 
TYICFS*ATKNTYTAAA .SFI 

TACTCTCATCTTTTACCAACACAAAGGGTGGATATTTGT^ 54 0 

YSHLLPTQRVDICSLELIII 
TLIFYQHKGWIFVHWS * * * Y 
LSSFTNTKGGYLFIGVDNNT 

CACACAAAGTAATTGGATTCACGGTGGGTCATGACTACCTCAGACTGGTAGAGAATGATA 600 
H7K*LDSRWVMTTSDW*RM" 

TQSNWIHGGS*LPQTGRE**~Y 
HKVJ GFTVGHDYLRLVENC*! 

TAGAAAAGCATATCAAAAGACTTCGTGTTGTGCATTTCTGTGAGAAGAAAGAGGACATCA 660 
*KSISKDFVLCI SVRRKRT C 
RK AYQKTSCCAFL* EE RGHQ 
EK. HIKRLRVVHFCEKKEDIK 
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Fig.21(Cont 1). 



720 



AGTACACGTGTCGATTCATCAAGGTATATAAACCTGGGGATGAGGCTACCTCGACATACG 
STRVDS SRYINLGMRLPRHT 
VHVSIHQGI*TWG»GYLDIR 
YTCRFIKVYKPGDEATSTYV 

TGTGCGCTATCAAAGTGGAAAGATGCTGTTGTGCTGTGTTTGCAGATTGGCCAGAATCAT 7Sn 
CALSKWKDAVVLCLQIGQNH 
VRYQSGKMLLCCVCRLA RI M 
CAIKVERCCCAVFADWPESW 

GGTATATGGATACTAATGGTATCAAGAAGTATTCTCOVGATGAATGGGTGTCACATATAA 8 4 n 

GIWILMVSRSILQMNGCHI* 
VYGY*WYQ£VFSR*MGVTYK 
YMDTNGIKKYSPDEWVSHIK 

AATTTTAATTAATGTAATAGAGAACAAATAATAAGGTTGTAATATCATATAGACAATAAC 

N FN-CNREQIIRL-YHIDNN 
ILINVIENK*-GCNII* TI T 

F*LM**RTNNKVVISYRQ. L 

TAACAATTAATTAGTAACTGTTATCTCTTTTTTAACTAACCAACT 
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